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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Hillman, Jennifer L . 

Goli, Surya K. 

(ii) TITLE OF THE INVENTION: NOVEL HUMAN MLS3 PROTEIN 



(iii) NUMBER OF SEQUENCES: 6 

M : (iv) CORRESPONDENCE ADDRESS: 

O (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

O (B) STREET: 3174 Porter Drive 

flj (C) CITY: Palo Alto 

hj (D) STATE: CA 

;jj (E) COUNTRY: USA 

fif (F) ZIP: 94304 

. 

J* (v) COMPUTER READABLE FORM: 
Li;. (A) MEDIUM TYPE: Diskette 

pi (B) COMPUTER: IBM Compatible 

r: (C) OPERATING SYSTEM: DOS 

["J (D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 
^ (A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0223 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 762280 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Phe Arg Phe Met Arg Asp Val Glu Pro Glu Asp Pro Met Phe Leu 

15 10 15 

Met Asp Pro Phe Ala lie His Arg Gin His Met Ser Arg Met Leu Ser 

20 25 30 

Gly Gly Phe Gly Tyr Ser Pro Phe Leu Ser lie Thr Asp Gly Asn Met 

35 40 45 

Pro Gly Thr Arg Ala Ala Ser Arg Arg Met Gin Gin Ala Gly Ala Val 

50 55 60 

Xaa Pro Phe Gly Xaa Leu Gly Met Ser Gly Gly Phe Met Asp Met Phe 
65 70 75 80 

Gly Met Met Asn Asp Met Xaa Gly Asn Met Glu His Met Thr Ala Gly 

85 90 95 

Gly Asn Cys Gin Thr Phe Ser Ser Ser Thr Val lie Ser Tyr Ser Asn 

100 105 110 

Thr Gly Asp Gly Ala Pro Lys Val Tyr Gin Glu Thr Ser Glu Met Arg 

115 120 125 

Ser Ala Pro Gly Gly lie Arg Glu Thr Arg Arg Thr Val Arg Asp Ser 

130 135 140 

Asp Ser Gly Leu Glu Gin Met Ser lie Gly His His lie Arg Asp Arg 
145 150 155 160 

Ala His lie Leu Gin Arg Ser Arg Asn His Arg Thr Gly Asp Gin Glu 

165 170 175 

Glu Arg Gin Asp Tyr lie Asn Leu Asp Glu Ser Glu Ala Ala Ala Phe 

180 185 190 

Asp Asp Glu Trp Arg Arg Glu Thr Ser Arg Phe Arg Gin Gin Arg Pro 

195 200 205 

Leu Glu Phe Arg Arg Leu Glu Ser Ser Gly Ala Gly Gly Arg Arg Ala 

210 215 220 

Glu Gly Pro Pro Arg Leu Ala lie Gin Gly Pro Glu Asp Ser Leu Pro 
225 230 235 240 

Asp Ser Pro Ala Ala Met Thr Gly Glu Gly Pro Gly Ala Ser Ala Leu 

245 250 255 

Leu Tyr Arg Leu Arg Gly 
260 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE; 762280 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GGGGGGCGTA CGGAGGTGGC AGCTGTGGGA GGAGGCGGCG TGGAAGGCCG AGGAGCTCAA 60 

GCCCGGACCA ATCCCCACGT TCCGGGCCGC CACCCTGACC CTGCAGCGTA CCGGGAAGCG 12 0 

AAACCGGCCG GATGGGCCGC TGAGCCCGAA TCGGGCACTG TGTGGAGCCC CCTGGAGCTG 180 

AGATCAGGAT GTTCCGCTTC ATGAGGGACG TGGAGCCTGA GGATCCCATG TTCCTGATGG 240 

ATCCCTTTGC TATTCACCGT CAGCATATGA GCCGTATGTT GTCAGGTGGC TTTGGATATA 300 

GCCCCTTCCT CAGCATCACA GATGGCAACA TGCCAGGGAC CAGGGC TGCC AGCCGCCGGA 3 60 

TGCAGCAGGC TGGAGCTGTC TNCCCCTTTG GGNTGC TGGG AATGTCGGGT GGTTTCATGG 42 0 

ACATGTTTGG GATGATGAAT GACATGNTTG GAAACATGGA ACACATGACA GCTGGAGGCA 480 

ATTGCCAGAC CTTCTCATCT TCCACTGTCA TCTCCTACTC CAATACGGGT GATGGTGCCC 540 

CCAAGGTCTA CCAAGAGACA TCAGAGATGC GCTCGGCACC AGGCGGGATC CGGGAGACAC 600 

GGAGGACTGT TCGGGATTCA GACAGTGGAC TGGAGCAGAT GTCCATTGGG CATCACATCC 660 

GGGACAGGGC TCACATCCTC CAGCGCTCCC GAAACCATCG CACGGGGGAC CAGGAGGAGC 72 0 

GGCAGGACTA TATCAACCTG GATGAGAGTG AGGCCGCAGC GTTTGATGAC GAGTGGCGGC 780 

GGGAGACCTC CCGATTCCGG CAGCAGCGTC CCCTGGAGTT TCGGCGGCTT GAGTCCTCAG 840 

GGGCTGGGGG ACGAAGGGCG GAGGGGCCTC CCCGCCTGGC CATCCAGGGA CCTGAGGACT 900 

CCCTTCCCGA CAGTCCCGCC GCTATGACTG GTGAGGGCCC CGGGGCCTCA GCTCTCTTGT 960 

ACAGGCTGAG AGGCTGAGAA ATCATCCCCT GAATAACTTT TTCCTCTCGA TTCCCATCCC 102 0 

CAATTTAATA TTAAATTAAC AGGCAAGCCG GCCCCCACCT CTCCCTGGGG GTCTCAGGGA 1080 

GAACCTTTCA CGGCACCCTT TCCCTACCTT TTCCTTCTTT AATCTCCTGG TTTACCATTG 1140 

ATGACTTCGG CTCTGCATCT ACTTACTTGA TTTTTCATTC TGCCACTTCA TCTTCAAACC 12 00 

CCCTCACCTT TCCCATCCTA CTCCTGCCAT GCATTGAAGG GTCAATGCAT TTTGGGGTGA 1260 

GNTTNGGTTT AGGGGCCCCT TCATNCCTNA GCTACCTGGG TCTTTGCCCA ACTTTTCTCA 132 0 

GA 1322 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1066392 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Phe Arg Met Leu Asn Ser Ser Phe Glu Asp Asp Pro Phe Phe Ser 

15 10 15 

Glu Ser lie Leu Ala His Arg Glu Asn Met Arg Gin Met lie Arg Ser 

20 25 30 

Phe Ser Glu Pro Phe Gly Arg Asp Leu Leu Ser lie Ser Asp Gly Arg 

35 40 45 

Gly Arg Ala His Asn Arg Arg Gly His Asn Asp Gly Glu Asp Ser Leu 

50 55 60 

Thr His Thr Asp Val Ser Ser Phe Gin Thr Met Asp Gin Met Val Ser 
65 70 75 80 

Asn Met Arg Asn Tyr Met Gin Lys Leu Glu Arg Asn Phe Gly Gin Leu 

85 90 95 

Ser Val Asp Pro Asn Gly His Ser Phe Cys Ser Ser Ser Val Met Thr 
100 105 110 
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Tyr Ser Lys 
115 

Gin Thr Arg 
130 

Arg Asp Ser 
145 

His Asp Arg 

Asp Glu Glu 

His Ala Phe 
195 

Gly Arg His 

210 
Asn Pro Gly 
225 

Pro Ala lie 
His lie Lys 



(2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE : 1066391 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



GTTATGTGTT 


CCCGTCCGTA 


CTGGAGGCTA 


GCTCTTGTCG 


CGGCCGCGGC 


GAGTTAACAT 


60 


CGTTTTTCCA 


ATCTGTCCGC 


GGCTGCCGCC 


ACCCAAGACA 


GAGCCAGAAT 


GTTCAGGATG 


120 


CTGAACAGCA 


GTTTTGAGGA 


TGACCCCTTC 


TTCTCTGAGT 


CCATTC TTGC 


ACACCGAGAA 


180 


AATATGCGAC 


AGATGATAAG 


AAGTTTTTCT 


GAACCCTTTG 


GAAGAGACTT 


GCTCAGTATC 


240 


TCTGATGGTA 


GAGGGAGAGC 


TCATAATCGT 


AGAGGACATA 


ATGATGGTGA 


AG ATTC TTTG 


300 


ACTCATACAG 


ATGTCAGCTC 


TTTCCAGACC 


ATGG AC C AAA 


TGGTGTCAAA 


TATGAGAAAC 


360 


TATATGCAGA 


AATTAGAAAG 


AAACTTCGGT 


CAACTTTCAG 


TGGATC C AAA 


TGGACATTCA 


420 


TTTTGTTCTT 


CCTCAGTTAT 


GACTTATTCC 


AAAATAGGAG 


ATGAACCGCC 


AAAGGTTTTT 


480 


CAGGCCTCAA 


CTCAAACTCG 


TCGAGCTCCA 


GGAGGAATAA 


AGGAAACCAG 


GAAAGCAATG 


540 


AGAGATTCTG 


AC AGTGG AC T 


AGAAAAAATG 


GCTATTGGTC 


ATCATATCCA 


TGACCGAGCT 


600 


CATGTCATTA 


AAAAGTCAAA 


GAACAAGAAG 


ACTGGAGATG 


AAGAGGTCAA 


CCAGGAGTTC 


660 


ATCAATATGA 


ATGAAAGCGA 


TGCTCATGCT 


TTTGATGAGG 


AGTGGCAAAG 


TGAGGTTTTG 


720 


AAGTACAAAC 


CAGGACGACA 


CAATCTAGGA 


AACACTAGAA 


TGAGAAGTGT 


TGGCCATGAG 


780 


AATCCTGGCT 


CCCGAGAACT 


TAAAAGAAGG 


GAGAAACCTC 


AACAAAGTCC 


AGCCATTGAA 


840 


CATGGAAGGA 


GATCAAATGT 


TTTGGGGGAC 


AAACTCCACA 


TCAAAGGCTC 


ATCTGTGAAA 


900 


AGCAACAAAA 


AATAAATAGC 


CATGCATTTG 


ATTTGTTTAG 


TTTTGATTGT 


TTTAACAGTT 


960 


AGTAATGGTG 


CTGGGTAATA 


AGCATAAGAC 


CAATCTCTTG 


CTGTTAAATC 


AGTTCTGTCC 


1020 


TTGGCAACTT 


TCTTCTGATA 


TCTGAATGTT 


CATGAAGGTC 


CTAGCTTTAT 


ATTGTCCCTC 


1080 


TTTTAGGAAT 


AAAATTTTGA 


TTTTCAACAA 


AAAAAA 






1116 
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lie Gly Asp Glu Pro Pro Lys Val Phe Gin Ala Ser Thr 

120 125 
Arg Ala Pro Gly Gly lie Lys Glu Thr Arg Lys Ala Met 

135 140 
Asp Ser Gly Leu Glu Lys Met Ala lie Gly His His lie 
150 155 160 

Ala His Val lie Lys Lys Ser Lys Asn Lys Lys Thr Gly 

165 170 175 

Val Asn Gin Glu Phe lie Asn Met Asn Glu Ser Asp Ala 
180 185 190 

Asp Glu Glu Trp Gin Ser Glu Val Leu Lys Tyr Lys Pro 

200 205 
Asn Leu Gly Asn Thr Arg Met Arg Ser Val Gly His Glu 

215 220 
Ser Arg Glu Leu Lys Arg Arg Glu Lys Pro Gin Gin Ser 
230 235 240 

Glu His Gly Arg Arg Ser Asn Val Leu Gly Asp Lys Leu 

245 250 255 

Gly Ser Ser Val Lys Ser Asn Lys Lys 
260 265 

) INFORMATION FOR SEQ ID NO: 4: 
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(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1399745 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



Met Phe Arg Phe Met Arg Asp Val Glu Pro Glu Asp Pro Met Phe Leu 

15 10 15 

Met Asp Pro Phe Ala lie His Arg Gin His Met Ser Arg Met Leu Ser 

20 25 30 

Gly Gly Phe Gly Tyr Ser Pro Phe Leu Ser lie Thr Asp Gly Asn Met 

35 40 45 

Pro Gly Thr Arg Pro Ala Ser Arg Arg Met Gin Gin Ala Gly Ala Val 

50 55 60 

Ser Pro Phe Gly Met Leu Gly Met Ser Gly Gly Phe Met Asp Met Phe 
65 70 75 80 

Gly Met Met Asn Asp Met lie Gly Asn Met Glu His Met Thr Ala Gly 

85 90 95 

Gly Asn Cys Gin Thr Phe Ser Ser Ser Thr Val lie Ser Tyr Ser Asn 

100 105 110 

Thr Gly Asp Gly Ala Pro Lys Val Tyr Gin Glu Thr Ser Glu Met Arg 

115 120 125 

Ser Ala Pro Gly Gly He Arg Glu Thr Arg Arg Thr Val Arg Asp Ser 

130 135 140 

Asp Ser Gly Leu Glu Gin Met Ser He Gly His His He Arg Asp Arg 
145 150 155 160 

Ala His He Leu Gin Arg Ser Arg Asn His Arg Thr Gly Asp Gin Glu 

165 170 175 

Glu Arg Gin Asp Tyr He Asn Leu Asp Glu Ser Glu Ala Ala Ala Phe 

180 185 190 

Asp Asp Glu Trp Arg Arg Glu Thr Ser Arg Phe Arg Gin Gin Arg Pro 

195 200 205 

Leu Glu Phe Arg Arg Leu Glu Ser Ser Gly Ala Gly Gly Arg Arg Ala 

210 215 220 

Glu Gly Pro Pro Arg Leu Ala He Gin Gly Pro Glu Asp Ser Pro Ser 
225 230 235 240 

Arg Gin Ser Arg Arg Tyr Asp Trp 
245 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1399744 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CTCTAAAGGG CAGCTGTGGG AGGAGGCGGC GTGGAAGGCC GAGGAGCTCA AGCCCGGACC 60 

AATCCCCACG TTCCGGGCCG CGACCCTGAC CCTGCAGCGT AC CGGG AAGC GAAACCGGCC 12 0 

GGATGGGCCG CTGAGCCCGA ATCGGGCACT GTGTGGAGCC CCCTGGAGCT GAGATCAGGA 18 0 

TGTTCCGCTT CATGAGGGAC GTGGAGCCTG AGGATCCCAT GTTCCTGATG GATCCCTTTG 240 

CTATTCACCG TCAGCATATG AGCCGTATGT TGTCAGGTGG CTTTGGATAT AGCCCCTTCC 3 00 

TCAGCATCAC AGATGGCAAC ATGCCAGGGA CCAGGCCTGC CAGCCGCCGG ATGCAGCAGG 3 60 

CTGGAGCTGT CTCCCCCTTT GGGATGC TGG GAATGTCGGG TGGTTTCATG GACATGTTTG 42 0 

GGATGATGAA TGACATGATT GGAAACATGG AACACATGAC AGCTGGAGGC AATTGC C AG A 48 0 

CCTTCTCATC TTCCACTGTC ATCTCCTACT CCAATACGGG TGATGGTGC C CCCAAGGTCT 54 0 

AC C AAGAGAC ATCAGAGATG CGCTCGGCAC CAGGCGGGAT CCGGGAGACA CGGAGGACTG 6 00 

TTCGGGATTC AGACAGTGGA CTGGAGCAGA TGTCCATTGG GCATCACATC CGGGACAGGG 6 60 

CTCACATCCT CCAGCGCTCC C GAAAC CATC GCACGGGGGA CCAGGAGGAG CGGCAGGACT 72 0 

ATATCAACCT GGATGAGAGT GAGGCCGCAG CGTTTGATGA CGAGTGGCGG CGGGAGACCT 780 

CCCGATTCCG GCAGCAGCGT CCCCTGGAGT TTCGGCGGCT TGAGTCCTCA GGGGCTGGGG 84 0 

GACGAAGGGC GGAGGGGCCT CCCCGCCTGG CCATCCAGGG AC C TGAGG AC TCCCCTTCCC 9 00 

GACAGTCCCG CCGCTATGAC TGGTGAGGGC CCCGGGCCCT CAGCCTCTCT TGTACAGGCT 9 60 

GAGAGGCTGA GAAATCATCC CCTGAATAAC TTTTTCCTCT CGATTCCCAT CCCCAATTTA 102 0 

ATATTAAATT AACAGGCAAG CCGGCCCCCA CCTCTCCCTG GGGGTCTCAG GGAGAACCTT 1080 

TCACGGCACC CTTTCCCTAC CTTTTCCTTC TTTAATCTCC TGGTTTACCA TTGATGACTT 1140 

CGCCTCTGCA TCTACTGACT TGATTTTTCA TTCTGCCACT CCATCTTCAA ACCCCCTCAC 12 00 

CTTTCCCATC CTACTCCTGC CATGCATTGA AGGGTCAATG CATTTTGGGG TGAGCTCTGG 12 60 

GTTTAGGGGC CCCCTCCATC CCTCAGCTAC CCTGGATCTT TGCCCACCTC TTCCTCAGAG 13 2 0 

CCCCCACTGA GGGGCCGTAG CC C TATC TAG GGCTGTGGAA GGAGCAGACT GGTTCCTAAC 13 80 

TCTCTCCCTC CTCCTGCCCA CACACATCAA AAGAATCTTC CCTACACCCT TCTCTGCCTT 144 0 

TATTTTTTGA TTTGTGCAAC TTGTAAC TAG GTGTTTATGG AATAAAGGAG AATGGAAAAA 1500 

AG 1502 
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